The Kullback-Liebler Divergence as a Lyapunov Function for Incentive Based Game Dynamics
نویسنده
چکیده
It has been shown that the Kullback-Leibler divergence is a Lyapunov function for the replicator equations at evolutionary stable states, or ESS. In this paper we extend the result to a more general class of game dynamics. As a result, sufficient conditions can be given for the asymptotic stability of rest points for the entire class of incentive dynamics. The previous known results will be can be shown as corollaries to the main theorem. 1 Information Theory and The Replicator Dynamics Information theory was originally developed by Claude Shannon and Warren Weaver [Sha01, SW49] as a mathematical framework to describe problems in communication including, but not limited to, data compression and storage. They introduced measures of information called entropy. Shannon’s entropy, denoted H(P ), is a measure of the average uncertainty in a random variable, P . It can be interpreted as the average number of bits needed to encode a message drawn i.i.d. from P. Maximizing the entropy can be used to give a lower bound on this average number of bits needed for encryption. For our purposes, the concepts of cross entropy and relative entropy will be of great use. The Kullback-Leibler divergence (KL divergence or DKL) [KL51], or relative entropy is a measure of information gain (loss) from one state to another. More precisely, it is an average measure of the additional bits needed 1In fact, the Shannon entropy is simply the Boltzmann entropy [Jay65] without the constants 1 ar X iv :1 20 7. 00 36 v1 [ m at h. D S] 3 0 Ju n 20 12 to store y given a code optimized to store x. It is defined as
منابع مشابه
Microeconomic Theory and the Kullback-Liebler Discrepancy: Some Remarkable Connections
Using a well-known utility of wealth function, the classic contingent claims model is shown to produce measures of expected wealth, certainty equivalence, gains from trade, and risk premia with relative entropy interpretations, as measured by the KullbackLiebler discrepancy. Author
متن کاملLyapunov Functions for Evolutionary Dynamics on Time Scales and Riemannian Geometries
We combine incentive, adaptive, and time-scale dynamics to study multipopulation dynamics on the simplex equipped with a large class of Riemannian metrics, simultaneously generalizing and extending many dynamics commonly studied in dynamic game theory and evolutionary dynamics. Each population has its own geometry, method of adaptation (incentive), and time-scale (discrete, continuous, and othe...
متن کاملAutomatic concept identification in goal-oriented conversations
We address the problem of identifying key domain concepts automatically from an unannotated corpus of goal-oriented human-human conversations. We examine two clustering algorithms, one based on mutual information and another one based on Kullback-Liebler distance. In order to compare the results from both techniques quantitatively, we evaluate the outcome clusters against reference concept labe...
متن کاملModel Confidence Set Based on Kullback-Leibler Divergence Distance
Consider the problem of estimating true density, h(.) based upon a random sample X1,…, Xn. In general, h(.)is approximated using an appropriate in some sense, see below) model fƟ(x). This article using Vuong's (1989) test along with a collection of k(> 2) non-nested models constructs a set of appropriate models, say model confidence set, for unknown model h(.).Application of such confide...
متن کاملAdaptive Game Playing Using Multiplicative Weights
We present a simple algorithm for playing a repeated game. We show that a player using this algorithm suffers average loss that is guaranteed to come close to the minimum loss achievable by any fixed strategy. Our bounds are nonasymptotic and hold for any opponent. The algorithm, which uses the multiplicative-weight methods of Littlestone and Warmuth, is analyzed using the Kullback–Liebler dive...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1207.0036 شماره
صفحات -
تاریخ انتشار 2012